Search CORE

3 research outputs found

Sharing large data collections between mobile peers

Author: Foley Christopher Eric
Gourlay Richard Scott
Tripney Brian Grieve
Wilson John
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2009
Field of study

New directions in the provision of end-user computing experiences mean that we need to determine the best way to share data between small mobile computing devices. Partitioning large structures so that they can be shared efficiently provides a basis for data-intensive applications on such platforms. In conjunction with such an approach, dictionary-based compression techniques provide additional benefits and help to prolong battery life

Crossref

University of Strathclyde Institutional Repository

Enlighten

Data value storage for compressed semi-structured data

Author: Tripney Brian Grieve
Publication venue
Publication date: 01/01/2012
Field of study

Growing user expectations of anywhere, anytime access to information require new types of data transfer to be considered. While semi-structured data is a common data exchange format, its verbose nature makes les of this type too large to be transferred quickly, especially where only a small part of that data is required by the user. There is consequently a need to develop new models of data storage to support the sharing of small segments of semi-structured data as existing XML compressors require the transfer of the entire compressed structure as a whole. This thesis examines the potential for bisimilarity-based partitioning (i.e. the grouping of items with similar structural patterns) to be combined with dictionary compression methods to produce a data storage model that remains directly accessible for query processing whilst facilitating the sharing of individual data segments. The use of dictionary compression is shown to compare favourably against Hu mantype compression, especially with regard to real world data sets, while a study of the e ects of di ering types of bisimilarity upon the storage of data values identi ed the use of both forwards and backwards bisimilarity as the most promising basis for a dictionary-compressed structure. Having employed the above in a combined storage model, a query strategy is detailed which takes advantage of the compressed structure to reduce the number of data segments that must be accessed (and therefore transferred) to answer a query. A method to remove redundancy within the data dictionaries is also described and shown to have a positive e ect in terms of disk space usage.Growing user expectations of anywhere, anytime access to information require new types of data transfer to be considered. While semi-structured data is a common data exchange format, its verbose nature makes les of this type too large to be transferred quickly, especially where only a small part of that data is required by the user. There is consequently a need to develop new models of data storage to support the sharing of small segments of semi-structured data as existing XML compressors require the transfer of the entire compressed structure as a whole. This thesis examines the potential for bisimilarity-based partitioning (i.e. the grouping of items with similar structural patterns) to be combined with dictionary compression methods to produce a data storage model that remains directly accessible for query processing whilst facilitating the sharing of individual data segments. The use of dictionary compression is shown to compare favourably against Hu mantype compression, especially with regard to real world data sets, while a study of the e ects of di ering types of bisimilarity upon the storage of data values identi ed the use of both forwards and backwards bisimilarity as the most promising basis for a dictionary-compressed structure. Having employed the above in a combined storage model, a query strategy is detailed which takes advantage of the compressed structure to reduce the number of data segments that must be accessed (and therefore transferred) to answer a query. A method to remove redundancy within the data dictionaries is also described and shown to have a positive e ect in terms of disk space usage

STAX (Strathclyde Repository)

Data value storage for compressed semi-structured data

Author: Ross Isla
Tripney Brian Grieve
Wilson Francis
Wilson John
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Growing user expectations of anywhere, anytime access to information require new types of data representations to be considered. While semi-structured data is a common exchange format, its verbose nature makes files of this type too large to be transferred quickly, especially where only a small part of that data is required by the user. There is consequently a need to develop new models of data storage to support the sharing of small segments of semi-structured data since existing XML compressors require the transfer of the entire compressed structure as a single unit. This paper examines the potential for bisimilarity-based partitioning (i.e. the grouping of items with similar structural patterns) to be combined with dictionary compression methods to produce a data storage model that remains directly accessible for query processing whilst facilitating the sharing of individual data segments. Study of the effects of differing types of bisimilarity upon the storage of data values identified the use of both forwards and backwards bisimilarity as the most promising basis for a dictionary-compressed structure. A query strategy is detailed that takes advantage of the compressed structure to reduce the number of data segments that must be accessed (and therefore transferred) to answer a query. A method to remove redundancy within the data dictionaries is also described and shown to have a positive effect on memory usage

University of Strathclyde Institutional Repository

Enlighten